智能论文笔记

Semantic uncertainty intervals for disentangled latent spaces

Swami Sankaranarayanan , Anastasios N. Angelopoulos , Stephen Bates , Yaniv Romano , Phillip Isola

分类：计算机视觉 | 人工智能 | 机器学习 | (统计)机器学习

2022-07-20

计算机视觉中有意义的不确定性量化需要有关语义信息的推理 - 例如，照片中的人的头发颜色或街上汽车的位置。为此，最近在生成建模方面的突破使我们能够在分离的潜在空间中代表语义信息，但是在语义潜在变量上提供不确定性仍然具有挑战性。在这项工作中，我们提供了原则上的不确定性间隔，这些间隔可保证为任何潜在的生成模型包含真正的语义因素。该方法执行以下操作：（1）它使用分位数回归来输出潜在空间中每个元素的启发式不确定性间隔（2）校准了这些不确定性，以使它们包含新的，看不见的输入的潜在值。然后可以通过发电机传播这些校准间隔的终点，以为每个语义因素产生可解释的不确定性可视化。该技术可靠地传达了语义上有意义的，有原则和实例自适应的不确定性，例如图像超分辨率和图像完成。

translated by 谷歌翻译

Generate To Adapt: Aligning Domains using Generative Adversarial Networks

Swami Sankaranarayanan , Yogesh Balaji , Carlos D. Castillo , Rama Chellappa

分类：

2017-04-06

Domain Adaptation is an actively researched problem in Computer Vision. In this work, we propose an approach that leverages unsupervised data to bring the source and target distributions closer in a learned joint feature space. We accomplish this by inducing a symbiotic relationship between the learned embedding and a generative adversarial network. This is in contrast to methods which use the adversarial framework for realistic data generation and retraining deep models with such data. We demonstrate the strength and generality of our approach by performing experiments on three different tasks with varying levels of difficulty: (1) Digit classification (MNIST, SVHN and USPS datasets) (2) Object recognition using OFFICE dataset and (3) Domain adaptation from synthetic to real data. Our method achieves state-of-the art performance in most experimental settings and by far the only GAN-based method that has been shown to work well across different datasets such as OFFICE and DIGITS.

translated by 谷歌翻译

MERLIN: Multi-agent offline and transfer learning for occupant-centric energy flexible operation of grid-interactive communities using smart meter data and CityLearn

Kingsley Nweye , Siva Sankaranarayanan , Zoltan Nagy

分类：机器学习 | 人工智能

2022-12-31

The decarbonization of buildings presents new challenges for the reliability of the electrical grid as a result of the intermittency of renewable energy sources and increase in grid load brought about by end-use electrification. To restore reliability, grid-interactive efficient buildings can provide flexibility services to the grid through demand response. Residential demand response programs are hindered by the need for manual intervention by customers. To maximize the energy flexibility potential of residential buildings, an advanced control architecture is needed. Reinforcement learning is well-suited for the control of flexible resources as it is able to adapt to unique building characteristics compared to expert systems. Yet, factors hindering the adoption of RL in real-world applications include its large data requirements for training, control security and generalizability. Here we address these challenges by proposing the MERLIN framework and using a digital twin of a real-world 17-building grid-interactive residential community in CityLearn. We show that 1) independent RL-controllers for batteries improve building and district level KPIs compared to a reference RBC by tailoring their policies to individual buildings, 2) despite unique occupant behaviours, transferring the RL policy of any one of the buildings to other buildings provides comparable performance while reducing the cost of training, 3) training RL-controllers on limited temporal data that does not capture full seasonality in occupant behaviour has little effect on performance. Although, the zero-net-energy (ZNE) condition of the buildings could be maintained or worsened as a result of controlled batteries, KPIs that are typically improved by ZNE condition (electricity price and carbon emissions) are further improved when the batteries are managed by an advanced controller.

translated by 谷歌翻译

MLExchange -- A web-based platform enabling exchangeable machine learning workflows

Zhuowen Zhao , Tanny Chavez , Elizabeth Holman , Guanhua Hao , Adam Green , Harinarayan Krishnan , Dylan McReynolds , Ronald Pandolfi , Eric J. Roberts , Petrus H. Zwart

分类：机器学习 | 人工智能

2022-08-20

机器学习（ML）算法在帮助不同学科和机构的科学社区解决大型和多样化的数据问题方面表现出了增长的趋势。但是，许多可用的ML工具在编程方面要求且计算成本高昂。 MlexChange项目旨在建立一个配备有能力工具的协作平台，该平台使科学家和设施使用者没有深刻的ML背景来使用ML和计算资源进行科学发现。在高水平上，我们针对完整的用户体验，在该体验中，可以通过Web应用程序可以轻松获得管理和交换ML算法，工作流和数据。到目前为止，我们已经构建了四个主要组件，即中央职位管理器，集中式内容注册表，用户门户和搜索引擎，并成功地将这些组件部署到了测试服务器上。由于每个组件都是一个独立的容器，因此可以轻松地在不同尺度的服务器上部署整个平台或其个人服务，从笔记本电脑（通常是单个用户）到高性能群集（HPC）（同时）通过许多用户。因此，MlexChange使用方案使灵活性变得灵活 - 用户可以从远程服务器访问服务和资源，也可以在其本地网络中运行整个平台或其个人服务。

translated by 谷歌翻译

PS$^2$F: Polarized Spiral Point Spread Function for Single-Shot 3D Sensing

Bhargav Ghanekar , Vishwanath Saragadam , Dushyant Mehra , Anna-Karin Gustavsson , Aswin Sankaranarayanan , Ashok Veeraraghavan

分类：计算机视觉

2022-07-03

我们提出了一种依赖工程点扩散功能（PSF）的紧凑型快照单眼估计技术。微观超分辨率成像中使用的传统方法，例如双螺旋PSF（DHPSF），不适合比稀疏的一组点光源更复杂的场景。我们使用cram \'er-rao下限（CRLB）显示，将DHPSF的两个叶分开，从而捕获两个单独的图像导致深度精度的急剧增加。用于生成DHPSF的相掩码的独特属性是，将相掩码分为两个半部分，导致两个裂片的空间分离。我们利用该属性建立一个基于紧凑的极化光学设置，在该设置中，我们将两个正交线性极化器放在DHPSF相位掩码的每一半上，然后使用极化敏感的摄像机捕获所得图像。模拟和实验室原型的结果表明，与包括DHPSF和Tetrapod PSF在内的最新设计相比，我们的技术达到了高达50美元的深度误差，而空间分辨率几乎没有损失。

translated by 谷歌翻译

Mathematical Models of Human Drivers Using Artificial Risk Fields

Emily Jensen , Maya Luster , Hansol Yoon , Brandon Pitts , Sriram Sankaranarayanan

分类：机器学习 | 机器人

2022-05-24

在本文中，我们使用人造风险领域的概念来预测人类操作员如何控制车辆以应对即将到来的道路情况。风险领域将非负风险措施分配给系统状态，以模拟该状态与违反安全财产的距离，例如击中障碍或离开道路。使用风险字段，我们构建了操作员的随机模型，该模型从状态映射到可能的行动。我们在驾驶任务上展示了我们的方法，其中要求人类受试者在逼真的驾驶模拟器中驾驶汽车，同时避免在道路上遇到障碍。我们表明，通过解决凸优化问题，可以获得驾驶数据最有可能的风险字段。接下来，我们将推断的风险领域应用于产生不同的驾驶行为，同时将预测的轨迹与地面真相测量进行比较。我们观察到，风险场在预测未来的轨迹分布方面非常出色，预测精度高达二十秒预测范围。同时，我们观察到一些挑战，例如无法说明驾驶员如何根据道路条件选择加速/减速。

translated by 谷歌翻译

Human Detection of Political Deepfakes across Transcripts, Audio, and Video

Matthew Groh , Aruna Sankaranarayanan , Andrew Lippman , Rosalind Picard

分类：人工智能

2022-02-25

超现实视觉效果的技术的最新进展引起了人们的关注，即政治演讲的深层视频很快将与真实的视频录制无法视觉区分。通信研究中的传统观念预测，当故事的同一版本被视为视频而不是文字时，人们会更频繁地跌倒假新闻。在这里，我们评估了41,822名参与者在一个实验中如何将真实的政治演讲与捏造区分开来，在该实验中，演讲被随机显示为文本，音频和视频的排列。我们发现获得音频和视觉沟通方式的访问提高了参与者的准确性。在这里，人类的判断更多地依赖于话语，视听线索比所说的语音内容。但是，我们发现反思性推理调节了参与者考虑视觉信息的程度：认知反射测试的表现较低与对所说内容的过度依赖有关。

translated by 谷歌翻译

Power Allocation for Wireless Federated Learning using Graph Neural Networks

Boning Li , Ananthram Swami , Santiago Segarra

分类：机器学习

2021-11-15

我们提出了一种数据驱动的电力分配方法，在联邦学习（FL）上的受干扰有限的无线网络中的电力分配。功率策略旨在在通信约束下的流行过程中最大化传输的信息，具有提高全局流动模型的训练精度和效率的最终目标。所提出的功率分配策略使用图形卷积网络进行参数化，并且通过引流 - 双算法解决了相关的约束优化问题。数值实验表明，所提出的方法在传输成功率和流动性能方面优于三种基线方法。

translated by 谷歌翻译

Multi-Fake Evolutionary Generative Adversarial Networks for Imbalance Hyperspectral Image Classification

Tanmoy Dam , Nidhi Swami , Sreenatha G. Anavatti , Hussein A. Abbass

分类：计算机视觉

2021-11-07

本文提出了一种用于处理不平衡高光谱图像分类的新型多假进化生成的对抗网络（MFEGAN）。它是一种端到端的方法，其中在发电机网络中考虑了不同的生成目标损失，以改善鉴别器网络的分类性能。因此，通过将分类器网络嵌入识别函数的顶部，相同的鉴别器网络已被用作标准分类器。通过两个高光谱空间光谱数据集验证了所提出的方法的有效性。同样的生成和鉴别者架构已经与两个不同的GAN目标用于与所提出的方法进行公平的性能比较。从实验验证中观察到所提出的方法优于最先进的方法，具有更好的分类性能。

translated by 谷歌翻译

Programmable Spectral Filter Arrays using Phase Spatial Light Modulator

Vishwanath Saragadam , Vijay Rengarajan , Ryuichi Tadano , Tuo Zhuang , Hideki Oyaizu , Jun Murayama , Aswin C. Sankaranarayanan

分类：计算机视觉

2021-09-29

Spatially varying spectral modulation can be implemented using a liquid crystal spatial light modulator (SLM) since it provides an array of liquid crystal cells, each of which can be purposed to act as a programmable spectral filter array. However, such an optical setup suffers from strong optical aberrations due to the unintended phase modulation, precluding spectral modulation at high spatial resolutions. In this work, we propose a novel computational approach for the practical implementation of phase SLMs for implementing spatially varying spectral filters. We provide a careful and systematic analysis of the aberrations arising out of phase SLMs for the purposes of spatially varying spectral modulation. The analysis naturally leads us to a set of "good patterns" that minimize the optical aberrations. We then train a deep network that overcomes any residual aberrations, thereby achieving ideal spectral modulation at high spatial resolution. We show a number of unique operating points with our prototype including dynamic spectral filtering, material classification, and single- and multi-image hyperspectral imaging.

translated by 谷歌翻译